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ABSTRACT 


The system of a cart inverted pendulum has many problems such as 


nonlinearity, complexity, unstable, and underactuated system. It makes this 
system be a benchmark for testing many control algorithm. This paper 
presents a comparison between 2 conventional control methods consist 
of a linear quadratic regulator (LQR) and pole placement. The comparison 


indicated by the most optimal steps and results in the system performance 
that obtained from each method for stabilizing a cart inverted pendulum 
system. A mathematical model of DC motor and mechanical transmission 
are included in a mathematical model to minimize the realtime 
implementation problem. From the simulation, the obtained system 
LOR performance shows that each method has its advantages, and the desired 


Optimal control pendulum angle and cart position reached. 
Pole placement 
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1. INTRODUCTION 

Inverted Pendulum is a regular pendulum facing upwards, opposite the direction of gravity. 
This system is naturally unstable. One way to stabilize this system is to move the pivot point with a certain 
speed and acceleration in the direction of the vertical axis. The system of inverted pendulum is also 
nonlinear. By giving the same speed and acceleration to the system, the response from the pendulum 
angle of the rod varies. The system of an inverted pendulum is also underactuated because it has 
one input in the form of a vertical motion at the pivot pomt but has two outputs in the form 
of rod angles and the position of the pivot point. The nature of this inverted pendulum system makes 
the researchers make this system the choice of a test or benchmark tool for testing many control techniques 
since the 1973s to the present [1-2]. The PID control [3-6], fuzzy genetic algorithm [7-10], state-space 
control with pole placement methods [11-13], lmear quadratic regulator (LQR) [14-16], linear quadratic 
gaussian (LQG) [17-20], observer [21-23], NCTL [24], flower pollination [25], and others have been 
used in this system. 

This paper discusses the LQR and pole placement methods for stabilizing a cart inverted pendulum 
system. System modeling using the Lagrange equation. The equation of particle motion expressed 
by the Lagrange equation can be obtained by reviewing the kinetic energy and potential energy of the system 
without the need to review the forces acting on the system. After deriving the mathematical model using 
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the Lagrange equation, the system of a cart inverted pendulum is brought to a state-space form for further 
control using the LQR and pole placement control methods. 


2. RESEARCH METHOD 

The system of a cart inverted pendulum is illustrated in Figure 1. There is a thrust f which causes 
the cart to move on the horizontal axis x. This force is generated by a DC motor. w,, w, is the rotational 
speed in the pulley transmission system, T,,T, is the variable torque in the pulley transmission system, 
l is the rod length, and m is the pendulum mass centered on the rod end. This system has one input in 
the form of f and two outputs, namely the pendulumangle (@) and the cart position (x). 


DC motor 


pulley transmission system 





Figure 1. Acart inverted pendulum system 


2.1. Mathematical model 
2.1.1.Lagrange equation on the system 

Mathematical modeling of an inverted pendulum system using the Lagrange equation. Lagrangian 
function £ is expressed by kinetic energy K and potential energy P as follows: 


L=K(q,q@ — P(q) (1) 
where q = {q4, ed eee, Ans: The desired equation of motion is derived by using: 

d (=) OL 2) 

dt\dg,/ ðq Qi 


where Q; indicates the external force at the q; coordinate. The cart moves in the direction i only, 
so the kinetic energy on the cart is expressed as: 


1 
Ky =3 Mx? (3) 


The pendulum moves in the direction i and j, so that the kinetic energy in the pendulum 
is expressed as: 


C= tee (4) 
m 2 0 


Xg =x +l sin@ and h = l cos 0, then the equation becomes: 


A mla? + 2x16 cos @ + 1767) (5) 


So that the total kinetic energy of the inverted pendulum cart systemis: 


1 1 
K= 5 Mx" + 5 mx? + 2x16 cos @ + 1767) (6) 
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The potential energy in the cart inverted pendulum systemis only found in the pendulum mass (n). 


P=mgh 
P = mgl cos 0 7) 


Lagrange equation is obtained as follows: 
1 1 , ; 
£=>Mx? +> m(x? + 2416 cos @ + 176?) — mgl cos 6 (8) 


The cart inverted pendulum system has two generalised coordinates namely x and @, so the Lagrange 
equation for each common coordinate is described as follows: 


d /ðL oL 
za) -aT 2) 
d /ðL OL 
alsa) - a6 = ° a 


After being derived, the general coordinates from the Lagrange equation of the system of a cart 
inverted pendulum can be rewritten to: 


f=(M+4+m)*%+ mlé cos 6 — ml67sing (11) 
0 = ml(% cos 6 — 6xsin@) + ml26 — (—mx16 sin@ + mgl sin@) (12) 
Equations (11) and (12) are nonlinear equations because they have trigonometric terms sin () and cos Q). 


The controller is designed to make @ as small as possible. With this assumption, the lmear Lagrange equation 
of the systemof a cart inverted pendulum becomes: 


f = (M +m) + mið (13) 
0 =ë + lö- go a 
2.1.2. DC motor 


The armature DC motor equivalent circuit is shown in Figure 2. The open-loop system diagram in 
the form of state space is shown in Figure 3. 





Figure 2. Armature DC motor Figure 3. Open-loop system 


The equation of the armarture DC motor circuit is 


di 
iaRat La a +v, = v, (15) 
dé o , . S 
where v, = K, —, w = —, T] = K, ia. So the equation of the amartur DC motor circuit becomes: 
dt dt 
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ta d*t dé, 


1 
ee ee 16 
K, 0 T aqo | “eae Me oe) 


Generally, inductors in small rotors so that the armature circuit equation becomes: 


Kp K; 
T, = —K, R, + R,” (17) 
The angular velocity of the motor w, is expressed in the speed of the cart x to be w, = = and K, = = SO 
1 a 


the equation becomes: 


r 


x 
= —|-K,—+ 18 
1 Ki Va (18) 


So the Lagrange equation at the x coordinate becomes: 





K K,.K. E 

y, = (M +m) + = — ml ġ (19) 

k (r) 
Simplified equations with c, = ot andc, = 7 becomes: 

1 1 

ee 1 . 

t= yg 2a — c,x — mg) (20) 
From (14), it can be obtained: 

a d 

Ö = — (g0 — ž) (21) 


l 


Equation (20) is substituted to (21) so thatit becomes: 


oo. sed 1 
8 =*(90 — (ean cë- mg) ) (22) 
l M 
: Gs ëi M +m)g 
Gaes uS a e 23 
mM Mi~ mM = 


The Lagrange equation for the system of a cart inverted pendulum has been obtained and subsequently 
converted into state space. 


2.1.2. State space representation 
The form of state space of a systemis represented as: 


x = Ax + Bu (24) 
y = Cx + Du (25) 


where x € RÍ is a vector state, u E€ R¥ is a control input, A € RÍ*Í is a system matrix, B € R’** input 

matrix, C € RP% is an output matrix, and y € R? is an output vector. In a cart inverted pendulum system, 
qr sin ea E 

the state vector is defined as x = [x Ox ål , derived from the state vector as x = [x 0 x òl , and the control 


input u = v,. From the Lagrange equation that has been obtained previously, the state space form 
of the systemof a cart inverted pendulum is; 





[o 0 1 0] [0] 
P 0 0 n Le ae ae 
A= ee Oa B=| 2 — — T D= 
oe in PE =|) 1 o o 75r ps0 26) 
M+m)g Cy _ £2 
[0 Ml Ml o] a 
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2.2. Controller design 
2.2.1. Linear quadratic regulator 

In the LQR method, the weight matrix Q and R are determined to get the most optimal response. 
LQR is one method of designing a modern control system. The purpose of LQR is to regulate or make 
the output of y be zero with minimum input. In order to bring the system state from the initial state 
x(t) = x(0) to the desired state x(t) by minimizing J. In general, LQR problems are: 


J=; xT ORE) + uT ) Ru(t)] (27) 


where Q € R”*” and R € R””™. Q and R each determine the performance and energy control needed. 
The optimal gain of feedback K can be obtained by; 


K = R`B'P (28) 
where P is a positive definite matrix which is a solution of the Riccati equation with: 
ATP + PA — PBR`'B'P+Q=0 (29) 


2.2.2. Pole placement 

Pole placement is a control method that is also used to get the most optimal response. 
Pole placement is also a modern control method. The location of the pole can be used as a reference to find 
out whether a system is stable or not. A stable system has a pole to the left of the imaginary axis, while 
a system that has a pole to the right of the imaginary axis is an unstable system. Therefore, there needs to be 
a control method to bring the pole which is to the right of the imaginary axis to the left of the imaginary axis 
so that the system becomes stable. Pole placement is a control method that can bring the pole to the right 
position. By using the gain feedback K, the desired design criteria of the systemcan be met. The polynomial 
characteristics can be described as: 


[sI — Al = s” + as ™1 + = + an-s + an (30) 


where I is the identity matrix and a is the polynomial coefficient. If the desired eigenvalues [u], , un] can 
be considered, then the desired characteristics of the polynomial are; 


JCs — u) = s” +x st +e HAX Ss HX (31) 
The feedback matrix gain required by K can be calculated using; 
K = (x, — an lXp-1> aņn-1l 0 lx- azl %1- a] T7 (32) 
The LQR method and pole placement use the feedback matrix. The feedback matrix in question can be 


described as the matrix K in Figure 4. To support the performance of the system, the pre-gain N matrix 
is also shown in Figure 5. 


y 
C 





Figure 4. Closed-loop system Figure 5. A closed-loop systemwith pregain 


Controllability and observabilty of the system is something that must be met before the control 
is done. The controllability of the system is indicated by rank (C) = n, where n is equal to the number 
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of state variables for all initial state values. The rank of observability (0) must also be the same as 
the number of states, 


C = [B|AB|A°B |- |A"7'B] (33) 
O = [C|CA|CA?|---|CA" 4] (34) 
3. RESULT AND ANALYSIS 


The parameters of the system of a cart inverted pendulum are determined for simulations in 
the Linear Quadratic Regulator and the pole placement methods as in Table 1. 


Table 1. Cart inverted pendulum parameters 


Parameters Value 
R; 1.25 Q 
ri 0.02 m 
K, 0.015 V.s/rad 
M 1.2 Kg 
K; 0.015 N.m/A 
l 0.3 m 


The controllability and observability of the system of a cart inverted pendulum is obtained namely 
rank (€) =rank(O) = 4 so that control can be done with; 


[1 0 0 0 ] 
|o 1 0 0 | 
0 0.5  —0.1875 2.795 [o 0 1 0o | 
e=| 9 1667 0.625 -63.81 oui 0 0 1 | 
05  —0.187 2.795 —2.07 lo -1635 -0.3735 o | 
—1.667 0.625  -—63.81 27.34 lo 38.15 1.25 0 
10 0.6131 0.1406 —1.635 | 

0 —2.0438 -0.4688 38.15 


It is known that one of the poles of the systemis to the right of the imaginary axis. The pole is worth 
(0.6.1512, —6.2049, —0.3213) so control is needed. Control is done with the desired output in the form 
of a pendulum angle at the balance pomt and the position of the cart is at 0.3 meters. The pole -zero mapping 
from the systemof a cart inverted pendulum is shown in Figure 6. 


Pole-Zero Map 





0.8 
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2 
> 


Imaginary Axis (seconds”') 
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Figure 6. Pole-zero mapping 
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3.1. Simulation of the LQR method 


The simulation using the LQR method uses 5 variations of the weight matrix Q and R as in Table 2. 


The weighting matrix Q = [q,,q2,q43,q,| that is changed is q,,q, because that affects state x, 0 directly. 
The combination produces a feedback matrix (K) and pre-gain (N) as Table 3. From the 5 configurations in 


Table 2 and Table 3, the systemresponse was obtained as shown in Figures 7, 8, and 9. 


Table 2. LOR parameter configuration 


No qi q2 R 
1 0.1 1 1 
2 1 1 1 
3 1 2 1 
4 l 2 10 
5 2 2 10 


Table 3. N, K parameters 








No N K1 K2 K3 K4 
] -0.3162 -0.3162 -50.5224 -2.2810 -8.2416 
2 -1.0000 -1.0000 -53.6364 -3.3636 -8.7875 
3 -1.0000 -1.0000 -53.6561 -3.3644 -8.7892 
4 -0.3162 -0.3162 -50.5062 -2.2807 -8.2402 
5 -0.4472 -0.4472 -51.2154 -2.5234 -8.3645 
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Figure 7. The angle response ofthe pendulum with the LOR method 
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Figure 8. Cart position response with the LQR method 
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Figure 9. Control signal with the LQR method 


From Figure 7, 8, and 9 it can be seen that the greater the Q and R weighting matrix, the faster 
the system response to reach the desired state and the greater the control signal. However, by choosing 


a small R, the control signal can be minimized. Therefore, choosing the nght Q and R matrix will produce 
an optimal controller. 


3.2. Pole placement method simulation 


Simulation using the pole placement method uses 5 variations of the pole matrix S4, S2,S3, S4 as in 
Table 4. Poles are chosen randomly with negative values. The combination produces a feedback matrix (K) 
and pre-gain (N) as in Table 5. 


Table 4. Pole placement parameters 


No Sı S> S3 S4 

1 -6.18+0.19i  -6.18-0.19i -0.48+0.431 -0.48-0.43i 
2 -6.1840.191 -6.18-0.19% -0.48+0.43i -0.489 

3 -6.1840.191 -6.18-0.19% -0.48+0.43i -0.189 

4 -6.1840.191 -6.18-0.19% -0.48+0.431 -0.089 

5 -6.1840.19 -6.18-0.1% -0.48 -0.089 


Table 5. N, K parameters 


No N K1 K2 K3 K4 

1 -1.0000 -1.0000 -53.65 -3.3644 -8.7891 
2 -0.56+0 491 -0.56+0.491 -53.4143.481 -3.22+1.17i -8.74+0.6 li 
3 -0.21+0.19i -0.24+0.19i -50.99+3.32i -2.41+1.07i -8.32+0.581 
4 -0.10+0.091 -0.10+0.091 -50.18+43.261 -2.134+1.041 -8.18+0.571 
5 -0.1029 -0.1029 -50.1898 -2.1388 -8.1815 


From the 5 configurations in Table 4 and Table 5, the system response is obtained as shown in 


Figures 10, 11 and 12. From Figures 10, 11, and 12 it can be seen that the increasingly negative pole position 
produces an increasingly rapid response at the angle of the pendulum, but the overshoot also gets bigger. 
By looking at the control signal from each pole, the optimal controller can be selected. 
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<10 Step Response with Pole Placement Control with Gain 
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Figure 10. The angle response of the pendulum using the pole placement method 
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Figure 11. Response cart with the pole placement method 
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Figure 12. Control signal with the pole placement method 
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4. CONCLUSION 

Linear quadratic regulator and pole placement for stabilizing a cart inverted pendulum system has 
been successfully conducted. By using LQR, the system response can be obtained by selecting the weight 
matrix Q and R which is then continued with the calculation to get the desired pole. With the pole placement 
method, the system response can be chosen by directly selecting the desired pole, although it is necessary to 
repeatedly determine the pole to get the most optimal results. Both methods can produce optimal system 
response with fast systemresponse and minimal control signals. 
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